Replica placement for p2p redundant data storage on unreliable, non-dedicated machines

نویسندگان

  • Piotr Skowron
  • Krzysztof Rzadca
چکیده

P2P architecture appears to fit for enterprise backup. In contrast to dedicated backup servers, nowadays a standard solution, making backups directly on organization’s workstations should be cheaper (as existing hardware is used) and more efficient (as there is no single bottleneck server). However, non-dedicated machines cause other challenges. Update propagation algorithms must take into account frequent transient failures (computers turned off for nights or weekends). Hardware and network topology are heterogeneous. A replication system must not only handle resulting constraints (such as e.g. varying amount of disc space), but also take advantage of the diversity (and e.g., distribute replicas geographically). We present a p2p backup system using pairwise replication contracts between a data owner and a replicator (storing one of the replicas). In contrast to storing the data directly in a DHT, the contracts allow the system to optimize replicas’ placement depending on features of the data owner, the replicators and the network topology. To cope with replicators’ transient failures, the system propagates notification about the updates through asynchronous messages. Each peer has a few synchro-peers that, in case the peer is off-line, store these notifications. Once back online, the peer contacts its synchro-peers, and then explicitly requests latest versions of modified files it replicates from the owner, or from other replicas. We implemented the system and tested it on PlanetLab as well as 150 computers in the faculty’s student computer labs. We experimentally assess the cost of unavailability showing how the time needed for data backup increases non-linearly with machines’ unavailability. Individual workstations are certainly less reliable and less efficient than a dedicated backup server. Yet, as our experiments show, given a sufficient number of such workstations, a p2p backup system outperforms a centralized solution, benefiting from better scalability and distributed topology. P2P is a cheap and robust alternative for enterprise backup.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Flexible replica placement for optimized P2P backup on heterogeneous, unreliable machines

P2P architecture is a viable option for enterprise backup. In contrast to dedicated backup servers, nowadays a standard solution, making backups directly on organization’s workstations should be cheaper as existing hardware is used; more efficient as there is no single bottleneck server; and more reliable as the machines can be geographically dispersed. We present an architecture of a p2p backu...

متن کامل

Lifetime-Aware Replication for Data Durability in P2P Storage Network

Many p2p based wide-area storage networks have been proposed to provide scalable storage services by combining the idle resources of many unreliable nodes. These storage networks can also provide highly available and reliable storage services, by replicating each data on several nodes. The popular approach is availability based replication which uses individual node availability. However, some ...

متن کامل

Improve Replica Placement in Content Distribution Networks with Hybrid Technique

The increased using of the Internet and its accelerated growth leads to reduced network bandwidth and the capacity of servers; therefore, the quality of Internet services is unacceptable for users while the efficient and effective delivery of content on the web has an important role to play in improving performance. Content distribution networks were introduced to address this issue. Replicatin...

متن کامل

Route Redirection in Structured Network Using Chord

In a structured peer-to-peer (p2p) network, Distributed Hash Tables (DHT) are maintained by nodes to store information and to route the packets. But sometimes a node can be attacked and so it may deny the DHT key access or misroute the packets. Replica Placement is a key mechanism to overcome these problems by placing replica for nodes in p2p network. MAXDISJOINT replica placement strategy crea...

متن کامل

Max Percentile Replication for Optimal Performance in Multi-Regional P2P VoD Systems- Technical Report

Peer-to-peer based (P2P) VoD systems have proven to be an effective solution for scalable video distribution. In P2P VoD, each peer contributes storage to replicate videos and assist video delivery. A fundamental question is how to optimally replicate video content across the peers so as to maximize their upload capacity. We study this question within the context of a large-scale P2P network wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1212.0427  شماره 

صفحات  -

تاریخ انتشار 2012